Using the R Package crlmm for Genotyping and Copy Number Estimation.

نویسندگان

  • Robert B Scharpf
  • Rafael A Irizarry
  • Matthew E Ritchie
  • Benilton Carvalho
  • Ingo Ruczinski
چکیده

Genotyping platforms such as Affymetrix can be used to assess genotype-phenotype as well as copy number-phenotype associations at millions of markers. While genotyping algorithms are largely concordant when assessed on HapMap samples, tools to assess copy number changes are more variable and often discordant. One explanation for the discordance is that copy number estimates are susceptible to systematic differences between groups of samples that were processed at different times or by different labs. Analysis algorithms that do not adjust for batch effects are prone to spurious measures of association. The R package crlmm implements a multilevel model that adjusts for batch effects and provides allele-specific estimates of copy number. This paper illustrates a workflow for the estimation of allele-specific copy number and integration of the marker-level estimates with complimentary Bioconductor software for inferring regions of copy number gain or loss. All analyses are performed in the statistical environment R.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

R/Bioconductor software for Illumina's Infinium whole-genome genotyping BeadChips

UNLABELLED Illumina produces a number of microarray-based technologies for human genotyping. An Infinium BeadChip is a two-color platform that types between 10(5) and 10(6) single nucleotide polymorphisms (SNPs) per sample. Despite being widely used, there is a shortage of open source software to process the raw intensities from this platform into genotype calls. To this end, we have developed ...

متن کامل

Genotyping with the crlmm Package

The crlmm package contains a new implementation for the CRLMM algorithm (Carvalho et. al. 2007). Our focus is on efficient genotyping of SNP 5.0 and 6.0 Affymetrix arrays, although extensions of the method are under development for similar platforms. This implementation, compared to the previous one (in oligo), offers improved confidence scores, quality scores for SNP’s and batches, higher accu...

متن کامل

VanillaICE : Hidden Markov Models for the Assessment of Chromosomal Alterations using High-throughput SNP Arrays

The starting point for this section of the vignette are B allele frequencies and log R ratios that are available from software such as GenomeStudio and the R package crlmm. In this section, we assume that the low-level summaries are available in a plain text file – one file per sample. For users of the crlmm package for preprocessing, please refer to the crlmmDownstream vignette. To illustrate ...

متن کامل

A multilevel model to address batch effects in copy number estimation using SNP arrays.

Submicroscopic changes in chromosomal DNA copy number dosage are common and have been implicated in many heritable diseases and cancers. Recent high-throughput technologies have a resolution that permits the detection of segmental changes in DNA copy number that span thousands of base pairs in the genome. Genomewide association studies (GWAS) may simultaneously screen for copy number phenotype ...

متن کامل

Detection of de novo copy number alterations in case-parent trios using the R package MinimumDistance

For the analysis of case-parent trio genotyping arrays, copy number variants (CNV) appearing in the offspring that differ from the parental copy numbers are often of interest (de novo CNV). This package defines a statistic, referred to as the minimum distance, for identifying de novo copy number alterations in the offspring. We smooth the minimum distance using the circular binary segmentation ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Journal of statistical software

دوره 40 12  شماره 

صفحات  -

تاریخ انتشار 2011